Modeling Acquisition of Word Structure with Lexicalized Grammar Learning

نویسنده

  • Çağrı Çöltekin
چکیده

This paper introduces a framework for learning structure in natural languages, and reports results from a simple application of it to learning word-syntax of an agglutinative language in an unsupervised manner. Arguably, the learning environment of children acquiring languages provides more information—by means of linguistic interaction and extralinguistic information present in the learning setting— than the information provided to an unsupervised learner. However, completely unsupervised learning methods can still provide insights into how children acquire language, at least, (i) by setting a lower bound on what is learnable, (ii) by identifying type and quantity of cues in the input that is useful for successful learning, (iii) by testing different learning methods, algorithms and frameworks on the basis of how successful they are in learning from the data available to children and how well they match with the available data from developmental psycholinguistics. In this paper, we will first describe the general learning framework based on learning a lexicalized grammar, Categorial Grammar (CG, Ajdukiewicz, 1935; Bar-Hillel, 1953), then we will present our morphological learner in more detail, followed by the results obtained on testing the learner on learning morphology of Turkish from child directed speech from CHILDES database. The learning algorithm uses techniques similar to unsupervised morphology learning systems such as Goldsmith (2001) and Creutz and Lagus (2007). However, this study tries to model human language acquisition more closely by using data from child directed speech and not assuming avialability of the complete data to the learner. Another major difference of this study is the emphasis on the structure learning. The model presented here learns a lexicalized word-grammar, which has similarities to other lexicalized grammar learners (e.g., Villavicencio, 2002; Zettlemoyer & Collins, 2005; Yao, Ma, Duarte, & Çöltekin, 2009).

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Modeling Acquisition of Word Structure with Lexicalized Grammar Learning

Introduction This paper introduces a framework for learning structure in natural languages, and reports results from a simple application of it to learning word-syntax of an agglutinative language in an unsupervised manner. Arguably, the learning environment of children acquiring languages provides more information—by means of linguistic interaction and extralinguistic information present in th...

متن کامل

Lexicalized Grammar Acquisition

This paper presents a formalization of automatic grammar acquisition that is based on lexicalized grammar formalisms (e.g. LTAG and HPSG). We state the conditions for the consistent acquisition of a unique lexicalized grammar from an annotated corpus.

متن کامل

The effect of Code switching on the Acquisition of Object Relative Clauses by Iranian EFL Learners

This study attempted to investigate the impact of teacher’s code-switching on the acquisition of a problematic grammatical structure, namely, object relative clauses, by intermediate EFL learners. Moreover, a secondary objective of the study was to determine the EFL learners’ attitudes and opinions regarding the effectiveness of teacher’s code-switching in their learning of a specific aspect of...

متن کامل

#238: Separating Surface Order and Syntactic Relations in a Dependency Grammar Topic Areas: L2. Syntax and Parsing under Consideration for Other Conferences (specify)? None #238: Separating Surface Order and Syntactic Relations in a Dependency Grammar

This paper proposes decoupling the dependency tree from word order, such that surface ordering is not determined by traversing the dependency tree. We develop the notion of a word order domain structure, which is linked but structurally dissimilar to the syntactic dependency tree. The proposal results in a lexicalized, declarative, and formally precise description of word order; features which ...

متن کامل

A model of syntactic disambiguation based on lexicalized grammars

This paper presents a new approach to syntactic disambiguation based on lexicalized grammars. While existing disambiguation models decompose the probability of parsing results into that of primitive dependencies of two words, our model selects the most probable parsing result from a set of candidates allowed by a lexicalized grammar. Since parsing results given by the lexicalized grammar cannot...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009